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present invention provides improved methods for manipulating recombinant DNA in ^^^.^^^ 
More specifically, the invention provides methods capable of altering a nucleic acid sequence present at the termini of a target se- 
quence. 
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TTTTiF OF TWF TWVEWTlONl 

USE OF EXO-SAMPLE NUCLEOTIDES IN GENE CLONING 

FTFT.T) OF THT! TNVENTIONt 

The invention relates to improved methods for 
5 manipulating recombinant DNA in gene cloning and 
expression. More specifically , the invention provides a 
method in which exo-sample nucleotides are used to alter 
either the 3' or 5' terminus of the nucleic acid sequence 
of a target sequence. 

10 BACKGROUND OF THE INVENTION ; 

Recombinant DNA methodologies capable of amplifying 
purified nucleic acid fragments have long been recognized. 
Typically, such methodologies involve the introduction of 
a desired nucleic acid fragment into a DNA or RNA vector, 
15 the clonal amplification of the vector, and the recovery 
of the amplified nucleic acid fragment. Examples of such 
methodologies are provided by Cohen et al. (U.S. patent 
4,237,224), Maniatis, T. et al. , Mplecular Cloning: A 
T.^rvK-^rvrv Manual . Cold Spring Harbor Laboratory, 1982, 

20 etc. 

In some instances, the desired nucleic acid molecule 
can be readily obtained from a source material. The 
molecule can then be inserted into a suitable vector by 
either adding "linker molecules" (see scheller et al ., 
25 science 19£: 177-180 (1977)) or by treating the desired 
molecule with a restriction endonuclease . 
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in other instances, however, the desired nucleic acid 
molecule cannot be obtained from a source material at a 
concentration or in an amount sufficient to permit gene 
cloning. In such a situation, it is necessary to amplify 
5 the nucleic acid molecule by, for example, template- 
directed extension, prior to introducing it into a 
suitable vector. Primer extension can be mediated by the 
"polymerase chain reaction" («PCR«), or other means. 

in the "polymerase chain reaction" or »PCR« the 
XO amplification of a specific nucleic acid sequence is 
achieved using two oligonucleotide primers complementary 
to regions of the sequence to be amplified (Figure 1) . 

The polymerase chain reaction provides a method for 
selectively increasing the concentration of a nucleic acid 
15 molecule having a particular sequence even when that 
molecule has not been previously purified and is present 
only in a single copy in a particular sample. The method 
can be used to amplify either single or double stranded 

DNA. tJt . . 

20 Reviews of the polymerase chain reaction are provxded by 
Mullis, K.B. r -i* snrina Harbor Svmp. Onmrit. Biol- 
51:263-273 (1986)); Saiki, R.K. , e£_iLU (B WT?chnoloqv 
3:1008-1012 (1985)) ; Mullis, K.B., afcjJ-. Enzvmol. 
1^:335-350 (1987); Erlich H. «fc_jLU, (BP 50,424; EP 

25 8 4,796, EP 258,017, EP 237,362); Mullis, K. (EP 201,184); 

Mullis K. «t al. . (US 4,683,202); Erlich, H. (US 
4,582,788); and Saiki, R. S*L_ajU (US 4,683,194) all of 
which references are incorporated herein by reference) . 

The ability to incorporate a gene sequence into a 

30 suitable vector is typically performed using restriction 
endonucleases. Thus, the vector and the desired gene 
sequence are treated with a restriction nuclease capable 
of producing compatible termini which can then be ligated 
together to form a covalently closed vector molecule. 

35 Preferably, the restriction enzyme is selected such that 
its recognition site is not present in the desired gene 
sequence . 
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It would be desirable to be able to generally alter 
the nucleotide sequences of a desired target sequence in 
order to permit it to be cloned into a suitable vector 
without using oligonucleotide linkers /adapters, and 
regardless of the availability or suitability of 
restriction sites. The present invention provides methods 
suitable for accomplishing these goals. 



S UMMARY Q V TUTi! INVENTION: 

The present invention provides improved methods for 
10 manipulating recombinant DNA in gene cloning and 
expression. More specifically, the invention provides 
methods capable of altering a nucleic acid sequence 
present at the termini of a target sequence. 

In detail, the invention provides a method for 
15 incorporating a double-stranded linear desired nucleic 
acid molecule into a double-stranded vector, comprising: 

(A) forming a modified desired nucleic acid molecule 
characterized in possessing a first region of pre-selected 
sequence at at least one terminus of a first strand, the 

20 sequence containing at least one dU residue; 

(B) treating the first region of pre-selected 
sequence under conditions sufficient to result in the 
removal of the uracil base of at least one of the dU 
residues, to thereby form a protruding terminus capable of 

25 hydrogen bonding to a complementary sequenceboth of the 
strands, on at least one strand of the modified desired 
molecule; 

(C) incubating the modified molecule (B) in the 
presence of a modified vector having at least one 

30 protruding single-stranded terminus, and being capable of 
hydrogen bonding to at least one of the protruding 
terminus of the modified desired DNA molecule, to thereby 
incorporate the double-stranded linear desired nucleic 
acid molecule into the double- stranded vector. 
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The invention also provides the embodiment of the 
ab ove method wherein only one terminus of the modified 
desired molecule contains the dU-containmg sequence 

The invention also provides the embodiments of the 
5 above methods wherein the terminus is a 3- terminus, or a 
5 . terminus of the modified desired molecule. 

The invention also provides the embodiments of the 
abo ve methods wherein two termini of the modified desired 
molecule contain the dU-containing sequence The 
L0 invention also provides the embodiments of the above 
method wherein both of the termini are 3 • termini or both 
of the termini are 5' termini of the modified desired 

The invention also provides the embodiments of the 
« above methods wherein the termini of the first and second 
strands of the desired DNA molecule contain a plurality of 

dU residues. ^ 

The invention also provides the embodiments of the 
above methods wherein in step (B) , the an residues are 

20 treated with UDG under conditions sufficient to 

uracil base of at least one of the dU residues, to therby 
^ an abasic site, or wherein in step <B, additionally 
comprises treating the abasic site with 
under conditions sufficient to cleave the modified desired 

25 molecule at the abasic site. . 

The invention also provides the embodiments of the 
above methods wherein the regions of pre-selected sequence 
of the modified desired DNA molecule are identical. 

The invention also provides the embodiments of the 

30 above methods wherein in step (C) , the two protruding 
single-stranded termini are produced through the action of 
a restriction endonuclease, or through the ligation of an 
oligonucleotide to the vector or by (I) adding to the 

vector: , 
35 (i) a first region of pre-selected sequence at a 5 

terminus of a first strand, the sequence containing at 
least one dU residue; 
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(ii) a second region of pre-selected sequence at a 5' 
terminus of a second strand, the sequence containing at 
least one dU residue; and 

(II) treating the first and second regions of pre- 
5 selected sequence under conditions sufficient to result in 
the removal of the uracil base of at least one of the dU 
residues, to thereby form the modified vector having the 

protruding 3* termini. 

The invention further provides a circular nucleic 

10 acid molecule comprising: 

(A) a double-stranded linear or linearized vector 
molecule having two termini, A and B, each having a region 
of pre-selected sequence, and 

(B) a double-stranded desired nucleic acid molecule 
15 having two termini, I and II, each having a region of pre- 
selected sequence, 

wherein the region of pre-selected sequence of a 
first strand of the vector molecule at termini A and the 
region of pre-selected sequence of a second strand of the 

20 desired nucleic acid molecule at termini I are hybridized 
to one another; and 

wherein the region of pre-selected sequence of a 
second strand of the vector molecule at termini B and the 
region of pre-selected sequence of a first strand of the 

25 desired nucleic acid molecule at termini II are hybridized 
to one another. 

The invention further provides a kit specially 
adapted to contain in close compartmentalization a first 
container containing a double-stranded oligonucleotide, 

30 having at least one dU nucleotide at a terminus, of one 
strand, and a second container containing an enzyme 
capable of removing a uracil base of the dU residue. 

The invention also provides the embodiments of the 
above kit which additionally contains a third container 

35 containing a linearized double-stranded vector having at 
least one protruding terminus, the terminus having a 
sequence which is substantially similar to the nucleotide 
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of the dU-containing strand of the 
sequence or tIie 

"^Te 1 — on also provides the events of the 
above kit wherein the terminus is a 3 ■ or •«'*"!""!' 
5 or wherein two termini of the modified desired molecule 
contain the du-oontaining sequence. 

Figure 1 describes the use of two oligonucleotides 
complementary to regions of the sequence tc be amplified 
10 in a FCR amplification process. 

Figure 2 describes an embodiment wherein the exo- 
sample nucleotide is incorporated into one strand of a 
acuble-stranded oligonucleotide. The target 
depicted in Figure 2A. Figure 2B illustrates the 
» modification of the desired molecule so as to result in 
the alteration of the terminus of the molecule. Figure 2C 
shews the production of a protruding 3< *«■?»"•• 

Figure 3 shows an embodiment wherein- the exo 
nucleotide is incorporated into both strand, of m double- 
20 stranded molecule, and used to produce a molecule having 

two modified termini. 

Figure 4A describes an embodiment wherexn the exo 
sample nucleotide is incorporated into one strand of a 
double-stranded oligonucleotide to modify the 5- terminus 
25 of a molecule. The target molecule is depicted xn Figure 
2A. Figure 4B illustrates the removal of the dU- 

containing sequence. 

Figure 5 shows an embodiment wherexn the exo 
nucleotide is incorporated into both strands of a double- 
30 stranded molecule, and used to produce a molecule havxng 
two modified termini. 

Figure 6 shows a depiction of a primer. 

Figure 7 shows the structure that is formed by 
hybridization between the primer and the target sequence 
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by virtue of the homology between the sequences of the 3 • 
hybridizing region and the target molecule. 

Figure 8A shows a depiction of a dU-containing 
primer. Figure 8B shows a depiction of an embodiment 
5 wherein the entire primer contains exonucleotides . Figure 
8C shows a depiction of the hybridized structure formed 
between the target molecule and the primer. 

Figure 9 shows a depiction of an-exonucleotide- 
containing molecule that is in a form which can readily be 
10 readily inserted into a plasmid or other vector. 

Figure 10 shows the use of linkers to produce a 
linearized vector having protruding 3« termini. 

Figure 11 shows the use of PCR and exo-sample 
nucleotides to produce a linearized vector having 

15 protruding 3 • termini . 

Figure 12 shows the structures resulting from the 
removal of the dU residues from modified molecules. 

Figure 13 shows the use of the disclosed method to 
form a circular vector molecule. Figure 13A shows the 

20 modified molecules after destruction of exo-sample 
nucleotide. Figure 13B illustrates the loss of base 
pairing capacity of the region of pre-selected sequence 
after the destruction of the exo-sample nucleotide. 
Figure 13C illustrates the formation of the circular 

25 vector molecule containing the modified desired sequence. 
In Figures 13A and 13B the upper depiction illustrates the 
structure of the modified desired molecule, and the. lower 
depiction illustrates the structure of the modified vector 
molecule . 

30 nttSCRTPTTON OF THE PREFERRED EMBODIMENTS S 
I. TERMS USED IN MOLECULAR BIOLOGY 



In the description that follows/ a number of terms 
used in molecular biology and nucleic acid amplification 
technology are extensively utilized. In order to provide 
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a clearer and consistent understanding of the 
specification and claims, including the scope to be given 
such terms, the following definitions are provided. 

"Amplification", as used herein, refers to any xn 
5 vitro process for increasing the number of copies of a 
betide sequence or sequences. Nuclexc acxd 

amplification results in the incorporation of nucleotides 
into DNA or RNA. PGR is an example of a suitalbe method 
for DNA amplification. As used herein, one amplif xcatxon 
10 reaction may consist of many rounds of DNA replication. 
For example, one PCR reaction may consist of 10-50 
••cycles" of denaturation and replication. 

••Nucleotide" as used herein, is a term of art that 
refers to a base-sugar-phosphate combination. Nucleotides 
15 are the monomeric units of nucleic acid polymers, x.e. of 
DNA and RNA. The term includes ribonucleosxde 
triphosphates, such as rATP, rCTP, rGTP, or 
deoxyribonucleoside triphosphates, such as dATP, dCTP, 
dGTP or dTTP. A "nucleoside" is a base-sugar 
20 combination, i.e. a nucleotide lacking phosphate. 

"Exo-sample nucleotide", as used herein, refers to a 
nucleotide which is generally not found in a sequence of 
DNA. For most DNA samples, deoxyuridine is an example of 
an exo-sample nucleotide. Although the triphosphate form 
25 of deoxyuridine, dUTP, is present in living organisms as 
a metabolic intermediate, it is rarely incorporated xnto 
DNA When dUTP is incorporated into DNA, the resultxng 
deoxyuridine is promptly removed in vivo by normal 
processes, e.g. processes involving the enzyme uracxl DNA 
30 glycosylase (TOG) (Kunlcel, U.S. 4,873,192; Duncan, B.K., 
^ v.nzvmes 212:565-586 (1981), both references herexn 
incorporated by reference in their entirety) . Thus, 
deoxyuridine occurs rarely or never in natural DNA. It xs 
recognized that some organisms may naturally incorporate 
35 deoxyuridine into DNA. For nucleic acid samples of those 
organisms, deoxyuridine would not be considered an exo- 
sample nucleotide. Examples of other exo-sample 



nucleotides include bromodeoxyuridine, 7-methylguanine, 
5 , 6-dihyro-5 , 6 dihydr oxydeoxy thymidine , 3 - 
methyldeoxadenosine, etc. (see, Duncan, B.K., The Enzymes 
3££V:565-586 (1981)). Other exo-sample nucleotides will be 
evident to those in the art. For example, RNA primers 
used for DNA amplifications can be readily destroyed by 
alkali or an appropriate ribonuclease (RNase) * RNase H 
degrades RNA of RNA: DNA hybrids and numerous single- 
stranded RNases are known which are useful to digest 
single-stranded RNA after a denaturation step* 

The presence of deoxyuridine, or any other exo-sample 
nucleotide, may be readily determined using methods well 
known to the art. A nucleic acid molecule containing any 
such exo-sample nucleotide is functionally equivalent to 
DNA containing only dA, dC, dG or dT (dT is referred to 
herein as T) in all respects, except that it is uniquely 
susceptible to certain treatments, such as glycosylase 
digestion. Numerous DNA glycosylases are known to the 
art. An exo-sample nucleotide which may be chemically or 
enzymatically incorporated into an oligonucleotide and a 
DNA glycosylase that acts on it may be used in this 
invention. DNA containing bromodeoxyuridine as the exo- 
sample nucleotide may be degraded by exposure to light 
under well-known conditions. 

The use of exo-sample nucleotides to remove potential 
contaminants from samples being subjected to PCR 
amplification is disclosed by Longo, M.C. et al . . ( Gene 
92:125-128 (1990), Hartley, U.S. Patent No. 5,035,966), 
herein incorporated by reference in their entirety. This 
reference discloses the use of either dU-containing 
oligonucleotides or dUTP in the PCR-directed amplification 
of a target sequence. 

The "desired" or "target" gene sequence or nucleic 
acid molecule is the term used to designate the sequence 
which is to be either amplified, or incorporated into a 
vector (which may be circular or linear) , in order to 
achieve the objectives of the present invention. The 
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15 



20 



25 



30 



35 



sequence may be of any size or complexity. In general 
so Z information is Known about the desired sequence, each 
that the sequences of its termini can be ascer tained Any 
molecule which oan be amplified by PCR, or which has 
restriction sites at its termini can be used as the 
aesired or target sequences of the present invention. A 
-chimeric molecule is a vector (plasmid, cosmid, viral 
nucleic acid, etc.) which has been modified to carry or 
contain the desired gene sequence. 

two sequences are said to be "substantially similar 
in sequence- if they are both able to hybridize to the 
same oligonucleotide. 

The "terminus" of a nucleic acid molecule denotes a 
region at the end of the molecule. The term is not used 
herein as representing the final nucleotide of a Ixnear 
molecule, but rather a general region which is at or near 
an end of a linear or circular molecule. 

Two termini of two nucleic acid molecules are said to 
be the "same denominated termini," if the both termini are 
either the 3' termini of the respective molecules or both 
termini are the respective 5- termini of the respective 
m olecules. As used herein, the term "same denominated 
termini," is not intended to refer to the nucleotide 
sequence of the termini being compared. 

As used here.in, a DNA molecule is said to be 
-circular" if it is capable of depiction as either a 
covalently closed circle, or as a hydrogen bonded circle. 
A circular molecule may thus be composed of one or more 
polynucleotides bonded to one another via covalent or 
hydrogen bonds. The terminal nucleotide(s) of each 
polynucleotide may either be single-stranded, or may be 
bonded to another polynucleotide via covalent or hydrogen 

bonds* ^_„_, 
"Uracil DNA glyoosylas." (DDG) , a term of art, refers 
to an activity which cleaves the glycosidic bend between 
the base uracil and the sugar deoxyribose, only when the 
monomeric nucleotide dUTP is incorporated into a DHA 
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molecule, resulting in incorporation of a deoxyuridine 
moiety (Duncan, B. in The EnSVFtes 1±:565 (1981), ed. : 
Boyer p) . An enzyme possessing this activity does not act 
upon free dUTP, free deoxyuridine, or RNA (Duncan, supra ) ♦ 
The action of UDG results in the production of an "abasic" 
site. The enzyme does not, however, cleave the 
phophodiester backbone of the nucleic acid molecule. Most 
preferably, the phophodiester backbone at an abasic site 
may be claeved through the use of an endonuclease specific 
for such substrates. A preferred enzyme for this purpose 
is the tc. coli enzyme, Endonuclease IV. Most preferably, 
Endonuclease IV is used in conjunction with UDG to remove 
dU residues from a nucleic acid molecule. 

"Incorporating" as used herein, means becoming part 
15 of a nucleic acid polymer. 

"Terminating" as used herein, means causing a 
treatment to stop. The term includes means for both 
permanent and conditional stoppages. For example, if the 
treatment is enzymatic, a permanent stoppage would be heat 
20 denaturation; a conditional stoppage would be, for 
example, use of a temperature outside the enzyme's active 
range. Both types of termination are intended to fall 
within the scope of this term. 

"Oligonucleotide" as used herein refers collectively 
25 and interchangeably, to two terms of art, "oligonucleotide" 
and "polynucleotide" . Note that although oligonucleotide 
and polynucleotide are distinct terms of art, there, is no 
exact dividing line between them and they are used 
interchangeably herein. An oligonucleotide is said to be 
3 0 either an adapter, adapter/ linker or installation 
oligonucleotide (the terms are synonymous) if it is 
capable of installing a desired sequence onto a 
predetermined oligonucleotide. An oligonucleotide may 
serve as a primer unless it is "blocked. «. An 
35 oligonucleotide is said to be "blocked," if its 3- 
terminus is incapable of serving as a primer. 
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.•oligonucleotide-dependent amplification" as used 
herein refers to amplification using an oligonucleotide or 
p^ynucleotide to amplify a nucleic acid sequence, ^n 
Ligonucleotide-dependent amplif ication x. . -V 
5 amplification that requires the presence of one or more 
oligonucleotides or polynucleotides that are two or more 
mononucleotide subunits in length and that end up as part 
of the newly-formed, amplified nucleic acid molecule. 
..Primer" as used herein refers to a 

10 oligonucleotide or a single-stranded polynucleotide that 
Is extended by covalent addition of nucleotide monomers 
Z£ amplification. ..-oleic acid amplification 
based on nucleic acid synthesis by a nucleic acid 
polymerase. «any such polymerases require the °* 

15 I p^er that can be extended to initiate such nucleic 
acid synthesis. A primer is typically 11 bases or longer, 
most preferably, a primer is 17 bases or longer. A 
minimum of 3 bases may, however, suffice. 

"Reaction volume- denotes a liquid suitable for 

20 conducting a desired reaction (such as amplif ication, 
hybridization, cDNA synthesis, etc.). 

A "ligase" is an enzyme that is capable of joining 
the 3- hydroxyl terminus of one nucleic acid molecule to 
a 5- phosphate terminus of a second nucleic acid molecule 

25 to form a single molecule. Wgase enzymes are discussed 
in Watson, J.D., Tn> mlTnlW Bi" l PTY "f the gene , 3rd 
E d., W.A. Benjamin, Inc., Henlo Park, CA (1977), and 

similar texts. 

When an enzymatic reaction, such as a lxgatxon or a 

30 polymerization reaction, is being conducted, it is 
preferable to provide the components required for such 
reaction in "excess" in the reaction vessel- "Excess" xn 
reference to components of the amplification reaction 
refers to an amount of each component such that the 

35 ability to achieve the desired amplification is not 
limited by the concentration of that component. When 
linKer/adapter molecules are used after legation, the 
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excess linker/ adapter present in the reaction is 
preferably separated from the reaction products, or 
removed from the reaction mix, so that they will not 
compete with the cloning of the desired sequence. Use of 
5 linker /adapter oligonucleotides containing dU residues 
allows one to destroy excess linker/adapter molecules by 
enzymatic degradation or other means. 

The methods of the present invention are explained 
partly through illustration. In these illustrations, 
10 sequence pairs, such as A and A' , BandB', C and C, 
X and X', and Y and Y', respectively, etc., are 
complementary to each other. Complementation need not be 
exact; homology sufficient for proper functioning, e.g. 
annealing and priming, will suffice. 

15 II. THE METHODS AND MOLECULES OF THE PRESENT INVENTION 

The present invention employs exo-sample nucleotides, 
most preferably the nucleotide dUTP (which, when incorpor- 
ated into a nucleotide sequence is designated as dU) to 
create a 3' or 5* overhanging extension in the target 
20 nucleic acid molecules. The nucleic acid molecules can be 
derived from PCR, or other methods, or can be isolated 
directly from suitable source materials. 

A MODIFICATION OF EITHER THE 3» OR 5' TERMINI OF 
A DESIRED NUCLEIC ACID MOLECULE USING 
25 LINKER/ADAPTER MOLECULES 

The present invention permits one to modify either 
the 3 • or 5' termini of a desired nucleic acid molecule so 
as to create either a 5' or 3' single-stranded overhanging 
region. The invention accomplishes this goal through the 
30 use of exo-sample nucleotides, preferably dU. In a first 
embodiment, the exo-sample nucleotide is incorporated into 
one strand of a double-stranded oligonucleotide. This 
oligonucleotide is then ligated to a terminus or to both 



termini of the desired molecule. Thus, xf the target 
m olecule is depicted as shown in Figure 2A, then tomodxfy 
the desired molecule so as to produce a protrudxng 3 
terminus, an exo-sam P le nucleotide is ligated to that 
terminus (Figure 2B) . Treatment with UDG results xn the 
removal of the uracil base of the dU residues, thereby 
producing abasic sites. The abasic sites can be cleaved 
with Endonuclease IV, or similar enzymatic actxvxtxes, to 
produce the desired modified molecule (Figure 2C) . 

As will be readily recognized, it is possible to 
modify both termini of the desired molecule through the 
ligation of the dU-containing oligonucleotide to both ends 
of the molecule (Figure 3) . 

in order to modify the desired molecule so as to 
produce a protruding 5- terminus, an exo-sample nucleotide 
is ligated to that terminus (Figure 4A) . As in the above 
embodiment, treatment with UDG results in the removal of 
the uracil base of the dU residues, thereby producing 
abasic sites, which can be cleaved with Endonuclease IV, 
or similar enzymatic activities, to produce the desxred 
modified molecule (Figure 4B) . Again, as in the above 
embodiment, it is possible to modify both termini of the 
desired molecule through the ligation of the dU-containxng 
oligonucleotide to both ends of the molecule (Figure 5) . 

In its most preferred embodiment, however, the 
present invention employs PCR to modify the 5 • termini of 
the desired molecule. 

icnnTPTCATION OF THE 5 • TERMINI OF A DESIRED 
S££?S5 MOLECULE USING PCR AMPLIFICATION 

In a second embodiment for modifying the 5 • termini 
of a desired molecule, so as to permit the production of 
a molecule having overhanging 3- termini, a variation of 
PCR amplification may be used. in this embodiment, the 
desired or target sequences of the present invention are 
modified using PCR so as to cause them to have 5' termxnx 
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which contain at least one, and preferably several exo- 
nucleotide molecules.' For this embodiment, a specialized 
primer is employed. The specialized primer may be added 
at any stage, either initially in the PCR reaction, or 
5 after any number of cycles of amplification. When added 
after one or more cycles of PCR, the initial cycles of 
amplification are conducted using conventional primers. 

C. THE SPECIALIZED PRIMERS OF THE PRESENT INVENTION 

The modification of the termini of the desired 
10 molecule is preferably accomplished using PCR with two 
specialized primers. 

Thus, each primer will be constructed such that it 
contains a 3 • hybridizing region which is complimentary to 
a 5' region of one strand of a desired DNA molecule. The 
15 primers will also contain a region of predetermined and 
pre-selected sequence (whose length is in general of the 
same order of magnitude as the 3» hybridizing region of 
the molecule). Most preferably, the region of pre- 
selected sequence will be approximately 10-20, and most 
20 preferably approximately 12, bases in length. There are 
no constraints with regard to the nucleotide sequence of 
the pre-selected region of the primer molecule. The 
sequence can be either repetitive, palindromic, or unique. 
Each of the primers may thus be depicted as shown in 

25 Figure 6. 

As indicated, the primers will be capable of 
hybridizing to one strand of the target sequence by virtue 
of the homology between the sequences of the 3' 
hybridizing region and the target molecule (Figure 7) . 

30 The 3' hybridizing region of the primer molecules 

need not be complementary to the precise termini of the 
target molecule. Indeed, the target molecule need not be 
a linear molecule. The purposes of the present invention 
will be achieved if the 3' hybridizing region of the 

35 primer molecules is capable of hybridizing to a region 
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which either contains or f lanKs the sequence which is to 

be cloned. . 4. 

A characteristic of the pre-selected sequence is that 

it will contain a nuBher of exo-sample >|d«tU« . 



5 will be preferably interspersed throughout. 

preferred embodiment, the pre-selected sequence xs 12 
Lses long and every third base is a dU. The prxmer may 
thus be depicted as shown in Figure 8A. 

The 3- hybridizing region of the primer may also 
10 contain one or more exo-sample nucleotides 

one embodiment of the invention the entire prxmer contaxns 
exo-sample nucleotides dispersed throughout CFxgure 8B) . 
The hybridized structure formed between the target 
molecule and the primer may thus be depicted as shown xn 

15 Figure 8C. . . . 

The primer or primers are thus incubated xn the 
presence of a sample which contains, or is suspected of 
containing the desired nucleic acid molecule. PGR, or an 
alternative method is then carried out in the manners 

20 described above. After at least one amplification cycle, 
a desired molecule can be produced by permittxng the 
primer-extension molecules to self hybridize. As wxll be 
recognized, the resultant molecule differs from the 
initial desired molecule in two respects. Fxrst, xt 

25 contains at both of its termini additional sequences 
corresponding to the pre-selected sequence regxon. 
second, the pre-selected sequence region at the 5' end of 
both of the amplified strands will contain the exo-sample 
nucleotides of the primer molecule(s) . This molecule xs 

30 in a form which can be readily inserted into a plasmxd or 
other vector in accordance with the methods of the 
invention. The molecule may be depicted as follows shown 
in Figure 9. 
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D. THE VECTOR 



a 



Any procaryotic or eukaryotic plasmid or viral vector 
can be modified to permit its use in accordance with the 
methods of the present invention. Where the vector is 
5 circular molecule, it is first linearized using, for 
example, a restriction endonuclease . The two termini of 
the linearized molecule are then altered to contain a 
sequence complimentary to one or both of the pre-selected 
sequences present in the amplified desired molecule. As 

10 in the case of the above-described primers, the termini of 
the linearized vector molecule can be altered by any of a 
variety of methods. 

In a preferred embodiment, linkers can be used and 
ligated to the ends of the linearized molecule to produce 

15 the desired proruding 3 » termini (Figure 10) . 

Alternatively, PCR can be used to produce linearized 
vector molecules having the suitable termini . The termini 
of the vector are modified to contain an exo-sample 
nucleotide-containing sequence which is complementary to 

20 the exo-sample nucleotide-containing sequence of the 
modified desired molecule. Thus, where the exo-sample 
nucleotide-containing sequence of the modified desired 
molecule has the sequence "X" the exo-sample nucleotide- 
containing sequence, of the modified vector shall have the 

25 sequence "X\ M (Figure 11). 

As will be noted, the effect of these manipulations 
is to produce a linearized molecule having termini which 
contain a pre-selected sequence which is complimentary to 
the pre-selected sequence contained in either or both of 

30 the termini of the modified desired sequence. 
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S THE CLONING OF THE MODIFIED DESIRED SEQUENCE 
INTO THE MODIFIED VECTOR 

The vector molecule and the desired molecule, as 
m odified in the manner described above, are incubated 
5 under conditions which cause the destruction of the exo- 
sample nucleotide. Thus, for example, where the exo- 
sample nucleotide is dU, the molecules are sub D ected to 
treatment with the enzyme UDG. The resulting structures 
are shown below (Figure 12) . As illustrated, treatment 
10 with UDG does not result in the scission of the 
phosphodiester backbone of the nucleic acid molecules 
Rather, it results in the production of abasic sites which 
are thus incapable of base-pairing with complimentary 
sequences. The presence or absence of these abasic site- 
15 containing sequences is irrelevant to the subsequent 
application of the methods of the invention. 

Due to the complimentarity of the modified vector and 
desired molecule sequences, continued incubation of the 
modified vector with the modified desired sequences, after 
20 destruction of the exo-sample nucleotide, permits a 
chimeric molecule to form (Figure 13). 

As will be appreciated, the use of sequences which 
are complimentary to only one end of the modified desired 
molecule permits one to insert the sequence in a um- 
25 directional manner Thus, for example, if one end of the 
modified desired molecule had a region of pre-selected 
sequence X and X' , and the other end of the molecule had 
a region of pre-selected sequence Y and Y ' , it would be 
possible to control the directionality of the insertion 
30 into the modified vector by employing a vector having 
regions of pre-selected sequence X' and X, and Y- and Y, 
respectively. 

Significantly, it is not necessary to remove the 
abasic regions from the circular molecule prior to 
35 transformation into a suitable microbial host. Similarly, 
it is not necessary to treat the circular molecule with a 
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DNA ligase or other agent in order to produce a double- 
stranded covalently closed circular molecule. The 
circular molecule described above can be used directly to 
transform recipient cells. 

The methods of the present invention are especially 
amenable for use in In vitro procedures which utilize 
enzymes to amplify specific nucleic acid sequences and 

especially to PCR. 

The present invention includes articles of 
manufacture, such as "kits." Such kits will, typically, 
be especially adapted to contain in close 

compartmentalization an instructional brochure, and a 
circular, or more preferably, linearized vector molecule 
whose 5' termini contain a region of pre-selected sequence 
15 which contains at least one exo-sample nucleotides. In a 
second embodiment, the kit will contain a modified vector 
molecule in which the exo-sample nucleotide has been 
destroyed to produce a protruding (i.e. overhanging) 3' 
terminus, or equivalently, a non-recessed 5* terminus 
which is incapable of base pairing with a complementary 
sequence . 

In sub-embodiments of the above embodiments, the kit 
may also contain a container containing an exo-sample 
nucleotide-containing oligonucleotide suitable for use as 
a primer for modifying the termini of a desired nucleic 
acid molecule and/ or a container which contains an enzyme 
capable of degrading an oligonucleotide which contains the 
exo-sample nucleotide. The kit may additionally contain 
buffers, enzymes, and the like. 
30 Having now generally described the invention, the 

same will be more readily understood through reference to 
the following examples which are provided by way of 
illustration, and are not intended to be limiting of the 
present invention, unless specified. 



20 
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EXAMPLE 1 



10 



15 



20 



Enzymes and reagents. 

Taq DNA polymerase was purchased from Perkxn Elmer- 
Cetus; dNTPs were from Boehringer Mannheim. Competent 
bacteria (DH10B) , proteinase K, and restriction enzymes 
were from BRL. Oligonucleotides were synthesized using an 
ABI-380A DNA synthesizer and 
Amplification of vector and human cosmid DNA. 

All PGR reactions were 50 microliters covered with 
mineral oil using the following final buffer 
concentration: 50 mM KCl, 10 mM Tris-HCl ( P H 8.4), 1.5 mM 
MgCl, , and 0.2 mM of each dNTP. A Perkin Elmer-Cetus 
thermal cycler was used to generate Alu-PCR products as 
well as to analyze the inserts from subclones. After an 
initial 5 minutes at 93°, 35 cycles of 1 minute at 60°, 1 
minute at 72° and 1 minute at 93° were used. An additional 
5 minutes at 72° was used for the last cycle. Twenty to 30 
ng of each of the four Notl-linearized cosmids was 
amplified; 1 ng of Xbal-linearized P UC119 (5) was 
amplified as described above. Products from PGR reactxons 
were analyzed by 1 % agarose gel electrophoresis in TAE 
buffer with ethidium bromide. 



UDG treatment. 

The vector and Alu-PCR products were precipitated 

25 with ethanol and dissolved in the following buffer {25 mM 
Tris-HCl [ P H 7.8], 10 mM MgCl 2 , 4 mM betamercaptoethanol, 
0.4 mM ATP). Single-stranded 3- overhangs consisting of 
10 nucleotides in the vector and 11 nucleotides in the 
Alu-PCR products were made by treating vector (225 ng) and 

30 Alu-PCR products (110 ng to 212 ng) each separately with 
UDG (BRL) in a final volume of 10 microliters for 10 
minutes at 37°. Initial experiments used 16 units of UDG, 
however as little as 1 unit has been found to be 
sufficient. A 10 minute treatment at 65° was used 

35 following the UDG treatment. 
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Cloning and transformation. 

UDG treated vector (45 ng) was combined with UDG 
treated Alu-PCR reaction products (45 ng to 106 ng) in a 
final volume of 20 microliters in the above Tris, MgCl 2 , 
5 betamercaptoethanol, ATP buffer for one hour at room 
temperature. Five microliters from each combination were 
transformed in 50 microliters of DH10B competent cells 
(BRL) following the manufacturers recommendations , and 
plated onto LB plates containing ampicillin, X-gal and 
10 IPTG. 

PGR analysis of transf ormants. 

Subclones were analyzed by PGR using the Alu primer. 
Single white colonies were dispersed into 12 microliters 
of 10 mM Tris-HCl (pH 7.5) , 1 mM EDTA, 50 micrograms per 

15 ml proteinase K and incubated at 55° for 15 minutes, 80° 
for 15 minutes, and chilled on ice. PCR components 
including the Alu primer were added and amplified for 30 
cycles using the above protocol. Five microliters of each 
analysis was run on an agarose gel for sizing. 

20 Analysis of the transf ormants obtained using this 
procedure showed efficient cloning of PCR products using 
the exo-sample nucleotide cloning method . Control 
reactions where the insert DNA, or the vector DNA or the 
UDG treatment had been omitted resulted in no 

25 transf ormants, indicating that the cloning method was 
dependent on the procedure outlined and embodied ih this 
application. 
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vm^v is P T,ai;MED IS; 
1. 



i. A method for incorporating a double-stranded linear 
desired nucleic acid molecule into a double-stranded 

vector, comprising: 

(A) forming a edified desired nucleic add molecule 
characterized in possessing a first region of P^f"** 
seguence at at least one terminus of a first strand, saxd 
sequence containing at least one du residue; 

<B> treating said first region of pre-selected sequence 
under conditions sufficient to result in the removal of 
the uracil base of at least one of said dU rescues, to 
thereby form a protruding terminus capable of hydrogen 
bonding to a complementary seguenceboth of said strands, 
on at least one strand of said modified desired molecule, 

(C) incubating said modified molecule (B) in the 
presence of a modified vector having at least one 
protruding single-stranded terminus, and being capable of 
hydrogen bonding to at least one of said 
terminus of said modified desired DNA molecule, to thereby 
incorporate said double-stranded linear desired nucleic 
acid molecule into said double-stranded vector. 



2 The method of claim 1 wherein only one terminus of 
said modified desired molecule contains said du-containing 

sequence • 

3 . The method of claim 2 wherein said terminus is a 3 • 
terminus of said modified desired molecule. 

4. The method of claim 2 wherein said terminus is a 5' 
terminus of said modified desired molecule. 
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5. The method of claim 1 wherein two termini of said 
modified desired molecule contain said dU-containing 
sequence . 

6. The method of claim 5 wherein both of said termini 
are 3» termini of said modified desired molecule. 

7 . The method of claim 5 wherein both of said termini 
are 5' termini of said modified desired molecule. 

8. The method of claim 1 wherein said termini of said 
first and second strands of said desired DNA molecule 
contain a plurality of dU residues. 

9 . The method of claim 1 wherein in step (B) , said dU 
residues are treated with UDG under conditions sufficient 
to remove the uracil base of at least one of said dU 
residues, to therby form an abasic site. 

10. The method of claim 9 wherein step (B) additionally 
comprises treating said abasic site with Endonuclease IV 
under conditions sufficient to cleave the modified desired 
molecule at said abasic site. 

11. The method of claim 1 wherein said regions of pre- 
selected sequence of said modified desired DNA molecule 
are identical. 

12. The method of claim 1 wherein in step (C) , said two 
protruding single-stranded termini are produced through 

25 the action of a restriction endonuclease. 



15 



20 



13. The method of claim 1 wherein in step (C) , said two 
protruding single-stranded termini are produced through 
the ligation of an oligonucleotide to said vector. 
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14 . The method of claim 1 wherein in step (C) , said two 
protruding single-stranded termini are produced by 
fl) adding to said vector: 

(i) a first region of pre-selected sequence at 
5 a 5- terminus of a first strand, said sequence containing 

at least one dU residue; 

(ii) a second region of pre-selected sequence 
at a 5' terminus of a second strand, said sequence 
containing at least one dU residue; and 
10 (ID treating said first and second regxons of pre- 

selected sequence under conditions sufficient to result xn 
the removal of the uracil base of at least one of saxd dU 
residues, to thereby fcrm said modified vector havxng saxd 
protruding 3* termini. 

is 15 A circular nucleic acid molecule comprising: 

(A) * a double-stranded linear or linearized vector 
molecule having two termini, A and B, each having a region 
of pre-selected sequence, and 

(B) a double-stranded desired nucleic acid molecule 
having two termini, I and II, each having a region of pre- 
selected sequence, _. . 

wherein the region of pre-selected sequence of a f xrst 
strand of said vector molecule at termini A and the regxon 
of pre-selected sequence of a second strand of saxd 
desired nucleic acid molecule at termini I are hybrxdxzed 

to one another; and 

wherein the region of pre-selected sequence of a second 
strand of said vector molecule at termini B and the region 
of pre-selected sequence of a first strand of said desired 
nucieic acid molecule at termini II are hybridized to one 



20 



25 



30 



another. 



35 



16 A kit specially adapted to contain in close 
compartmentalization a first container containing a 
double-stranded oligonucleotide, having at least one dU 
nucleotide at a terminus of one strand, and a second 
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container containing an enzyme capable of removing a 
uracil base of said dU residue, 

17. The kit of claim 16 which additionally contains a 
third container containing a linearized double-stranded 

5 vector having at least one protruding terminus, said 
terminus having a sequence which is substantially similar 
to the nucleotide sequence of said dU-containing strand of 
said oligonucleotide. 

18. The kit of claim 16 wherein said terminus is a 3 • 
10 terminus. 

19. The kit of claim 16 wherein said terminus is a 5' 
terminus . 
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